The modified Newtonian dynamics — MOND 
and its implications for new physics 
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No more salient issue exists in contemporary astrophysics and cosmology than that of the elusive 
"dark matter". For many years already Milgrom's paradigm of modified Newtonian dynamics 
(MOND) has provided an alternative way to interpret observations without appeal to invisible 
dark matter. MOND had been successful in elucidating economically the dynamics of disk galaxies 
of all scales, while doing less well for clusters of galaxies; in its original form it could not address 
gravitational lensing or cosmology. After reviewing some of the evidence in favor of MOND, I 
recollect the development of relativistic formulations for it to cope with the last deficiency. I comment 
on recent work by various groups in confronting TeVeS, a relativistic embodiment of MOND, with 
observational data on gravitational lensing and cosmology. Throughout I ask what sort of physics 
can be responsible for the efficacy of MOND, and conclude with an appraisal of what theoretical 
developments are still needed to reach a full description of the world involving no unobserved matter. 



1. Introduction 

Newtonian gravity theory has served physics and tech- 
nology faithfully for well over three centuries. Neverthe- 
less, it has long been known that it is only an approxi- 
mation to a relativistic gravitation theory, usually iden- 
tified with Einstein's 1915 general relativity (GR). GR 
has correctly predicted subtle effects in the dynamics of 
the solar system, in the celebrated Hulse- Taylor double 
pulsar, and has anticipated the existence of those exotic 
denizens of the universe, black holes, now confirmed by 
myriad observations of galactic nuclei, compact galactic 
X-ray sources, etc. The nonlinear aspects of gravitation 
in GR are crucial to these new settings: gravitation is 
more difficult than Newton supposed. But up until a few 
decades ago astronomers took comfort in the belief that 
if one sidesteps situations with extremely strong grav- 
itational fields, or with extremely rapid motions, then 
Newtonian theory is a good approximation to the truth. 

But is that so? Consider the situation in the realm of 
galaxies. Speeds are low compared to light's, and gravita- 
tional fields are weak there. So Newtonian physics should 
describe motions extremely well. Yet the accelerations of 
stars and gas clouds in the outskirts of spiral galaxies, 
and of galaxies swimming in the large clusters of galax- 
ies, well exceed Newtonian predictions made on the basis 
of the matter actually visible in these systems. Those ac- 
celerations are inferred by combining Doppler measured 
velocities with geometric assumptions, e.g. that in a par- 
ticular spiral galaxy stars move in circles on a plane. The 
geometric assumptions can be checked, at least statis- 
tically, and the acceleration discrepancy is found to be 
real and generic. Sometimes the problem is characterized 
thus: rotation or random velocities are much bigger than 
expected. I stress acceleration because it is the quantity 
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which directly measures the strength of the gravitational 
field. In any case, what is wrong? 

It was realized by Ostriker and Peebles [l[ that mas- 
sive halos would stabilize disk galaxies against the bar 
forming instabilities which, in Newtonian theory, are en- 
demic to systems with little velocity dispersion. Since 
about third of the spiral galaxies have no significant bar, 
this justified the hypothesis that disk galaxies (including 
spirals) are often embedded in massive but invisible ha- 
los. Thus started the trend to resolve the acceleration 
discrepancy by postulating the existence of much dark 
matter (DM) in systems ranging from the very tenuous 
dwarf spheroidal galaxies with visible masses ~ 1O 7 M0 to 
the great clusters of galaxies with observed masses in the 
1O 14 M ballpark 0, in brief in any system where an ac- 
celeration discrepancy exists. The DM's role is to provide 
the missing gravitational pull to account for the excessive 
accelerations (an easy introduction to the DM paradigm 
is provided by Khalil and Munoz 0]). But thirty years of 
astronomical exploration and laboratory experiments Q 
have yet to provide independent evidence of DM's exis- 
tence, e.g. 7 rays from its decay. Is there an alternative 
to DM? This review focuses on the modified Newtonian 
dynamics scheme put forward by Milgrom [5j, [fj, |7j as an 
alternative to the systematic appeal to DM. 



2. Sharpening the problem 

To pass judgment on DM and alternatives to it, it is 
well to take stock of the two overarching empirical facts 
in the phenomenology of disk galaxies. First, ever since 
the work of Bosma as well as Rubin and coworkers Q , it 
has been clear that gas clouds in the disks of spiral galax- 
ies, which serve as tracers of the gravitational potential, 
circle around each galaxy's center with a (linear) veloc- 
ity which first rises as one moves out of the center, but 
hardly drops as the radius grows to well beyond the vis- 
ible disk's edge. Yet a falloff of the velocity with radius 
was naively expected because, to judge from the light 
distribution in spiral galaxies, most of their visible mass 
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is rather centrally concentrated. Newtonian gravitation 
would thus predict that "rotation curves" (RC) should 
drop as r -1 / 2 outside the bright parts of these galaxies, 
but this is not seen in over a hundred extended RCs mea- 
sured with sufficient precision [ToL [ll[ • In fact in many 
spiral galaxies, particularly those possessing high surface 
brightness, the extended RC becomes fiat away from the 
central parts (FIG. [TJ. 
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FIG. 1: Collage of RCs of nearby spiral galaxies obtained 
by combining Doppler data from CO molecular lines for the 
central regions, optical lines for the disks, and HI 21 cm line 
for the outer (gas) disks. The rotation velocity in units of km 
s _1 is plotted vs galactocentric radius R in kiloparsecs (kpc); 
1 kpc « 3000 light years. It is seen that the RCs are flat to well 
beyond the edges of the optical disks (~ 10 kpc). Graph from 
Ref. |^, reprinted with permission from the Annual Review of 
Astronomy and Astrophysics, Volume 39 (c)2001 by Annual 
Reviews www.annualreviews.org 

Second, as originally pointed out by Tully and Fisher 
(TF) [12], the rotation velocity V ro t and the blue band 
luminosity Lb of a galaxy are simply correlated (FIG.[2]). 
In a more informative form [xo| . the TF law states that 
for disk galaxies the luminosity in the near infrared band, 
Lk 1 (a good tracer of stellar mass) , is proportional to the 
fourth power of the rotation velocity in the flat part of 
the RC, with a universal proportionality constant. 

The flat extended RCs have provided the logic under- 
pinning the DM paradigm as follows. In the flat part 
of a RC, the rotation velocity is r independent, so the 
centripetal acceleration goes as 1/r. Thus in the plane 
of the galaxy the gravitational field must be decreasing 
as 1/r. According to Poisson's equation of Newtonian 
theory, such a gravitational field, if assumed spherically 
symmetric, must be sourced by a mass distribution with 
1/r 2 profile (isothermal sphere model). Since the visible 
mass density (in stars and gas) in the inner disk drops 
much faster than this, it is consistent to assume that the 
total mass distribution in the outer parts is quasispher- 
ical. The conclusion is that each spiral galaxy must be 
immersed in a roundish DM halo with mass density pro- 
file tending, at large r, to 1/r 2 . 

But questions plague the halo hypothesis. The halo 
DM, though much searched for, has never been detected 
directly [4j . Cosmogonic simulations of DM halo forma- 
tion within Newtonian physics predict that such a halo 
would have a density profile behaving like 1/r for small 
r and gradually steepening to 1 /r 3 asymptotically. Only 
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FIG. 2: The TF correlation for a sample of galaxies as a log- 
log plot of the blue band luminosity Lb in units of 10 10 Lq vs. 
the asymptotic rotational velocity Kot in km/s. The straight 
line (mine) has slope exactly 4 and corresponds to MOND's 
prediction, Eq. ([2]), for a reasonable mass (in units of Mq) 
to Lb ratio Tg = 3. Graph reproduced from Ref. [ll| by 
permission of the American Astronomical Society. 



by fine tuning the halo's parameters is it feasible to match 
this kind of profile to the observed flat RCs as well as 
would the isothermal sphere (the need to fine tune pa- 
rameters of a halo model in order to fit the shape of the 
galaxy's RC was already clear long ago [13] )■ And the 
predicted 1/r "cusp" in the density profile is also obser- 
vationally problematic 0]. 

The DM halo hypothesis is evidently an attempt to re- 
solve the acceleration discrepancy within orthodox grav- 
itation theory. But in the coming to terms with this 
discrepancy, suspicion fell on Newtonian gravity already 
early in the game. Zwicky, who had exposed the accel- 
eration discrepancy in clusters of galaxies flol ]. opined 
much later that the discrepancy may reflect a failure of 
conventional physics fl6| . Concrete proposals for depar- 
tures from the Newtonian inverse square law in various 
settings were put forward by a number of workers p7| . 
All of these still regarded gravitation as a linear inter- 
action with the strength of the field proportional to its 
source's mass. 

As Milgrom realized |5|, such a modification of the 
gravity law, if relevant on the scales of galaxies, is in- 
compatible with the TF law. For one it would imply 
that a mass mi at T\ generates at the acceleration 
field Tn\g(r2 — with g depending only on fundamen- 
tal constants and on the relative positions of source and 
field points in accordance with the principle of homogene- 
ity of space. Then, obviously, for matter orbiting with 
speed V m t at radius R in the outskirts of a galaxy (with 
its source mass, M, concentrated at a fairly definite dis- 
tance R), the acceleration, V mt 2 /R should be equal to 
M\g(R)\. This would require not only |ff(iZ)| oc J? -1 , 
but also M oc V TOt 2 , the last incompatible with the TF 
law. Then again g(j2 — T"i) with the correct dimen- 
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sions, LT _2 M _1 , cannot be built by using only G, Ti — r\ 
and the scale of length £ at which gravity departs from 
Newtonian form. And inclusion of mass in the construc- 
tion (presumably the source's mass) would be against the 
spirit of linearity. The bottom line is that linear gravity, 
even if non-Newtonian, is incompatible with the TF law. 

3. The MOND paradigm 

Milgrom 0, @] proposed a novel paradigm which 
can be interpreted as reflecting non-Newtonian as well 
as nonlinear character of gravity already at the nonrela- 
tivistic level. He called the scheme Modified Newtonian 
Dynamics (MOND). The essential part of it is the rela- 
tion 

(j,(\a\/a )a = -V<S> N (1) 

between the acceleration a of a particle and the ambi- 
ent conventional Newtonian gravitational field — V$jv- 
If the function /i were unity, this would be usual New- 
tonian dynamics. Milgrom assumes that the positive 
smooth monotonic function fi approximately equals its 
argument when this is small compared to unity (deep 
MOND limit) , but tends to unity when that argument is 
large compared to unity (FIG. [3]). The do is a natural 
constant, approximately equal to 10 -10 ms -2 . It is a fact 
that the centripetal accelerations of stars and gas clouds 
in the outskirts of spiral galaxies tend to be below ao- 




FIG. 3: Two popular choices for Milgrom's /j, function: the 
"simple" function j-i(x) — x/(l + x) (solid) and the "standard" 
function /x(x) = x/yl + x 2 (dotted). 

How does the MOND formula (fT]) help? Consider again 
stars or gas clouds orbiting in the disk of a spiral galaxy 
(mass M) with speed V(r) at radius r from its center. 
We must identify \a\ with the centripetal acceleration 
Vlr) 2 jr. Sufficiently outside the main mass distribution 
we may estimate | V<t> jv | ~ GM/r 2 . And at sufficiently 
large r, \a\ will drop below ao and we shall be able to 
approximate fi(x) ~ x. Putting all this together gives 
V(r) < /r 2 ss aoGM/r 2 from which we may conclude two 
things. First, V(r) well outside the main mass distribu- 
tion becomes independent of r, that is, the RC flattens 
at some value Vf. Second, from the coefficients follows 

M = {Gao^Vf 4 . (2) 



Introducing the ratio of mass to luminosity L, T, we have 
the added prediction L = (Ga Y)~ 1 Vf 4 . Although T in 
the blue band varies somewhat with galaxy color, this 
last results explains the TF law, FIG. [2] Theoretically T 
is much less variable in the near infrared (K') band [18| . 
which accounts for the extra sharpness of the TF relation 
in that band (see Fig. 2 in Ref. OB). 
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FIG. 4: The baryonic TF correlation for galaxies spanning 
6 orders of magnitude in mass. This is a log-log plot of the 
total baryonic (stars plus gas) mass Alb in units of the solar 
mass Mq vs. the observed asymptotic rotational velocity Vf 
in km/s. For each of the 60 galaxies from Ref. [h] (deep blue 
circles) the mass in stars comes from a fit of the shape of the 
RC with MOND, whereas for the eight dwarf spirals (light 
blue circles) the mass in stars (relatively small) is inferred in 
Ref. Qi| directly from the luminosity. The green line, with 
slope 4, is MOND's prediction, Eq. ([2]). Graph reproduced 
from Ref. by permission of the American Astronomical 
Society. 

The impressive agreement of Eq. $2$ with observations 
is most clear from the baryonic TF law FIG. [4] In 
full agreement with MOND, the observed baryonic mass 
in spiral galaxies is accurately proportional to the fourth 
power of the asymptotic rotation velocity. MOND's sin- 
gle formula thus unifies the two overarching facts of spiral 
galaxy phenomenology. 

MOND is similarly successful in explaining the detailed 
shapes of RCs. FIG.Oshows the measured RCs of fifteen 
galaxies together with the MOND fits based on surface 
photometry in the optical band and radio measurements. 
The only parameter adjusted in these impressive fits is 
the disk's stellar mass, or equivalently, the stellar mass- 
to-luminosity ratio T. The trend of the so determined 
values of T with galaxy color jibes with that predicted by 
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FIG. 5: MOND fits to measured RCs of a sampie of spiral 
galaxies The galactocentric radius (horizontal axis) is in kpc 
and the rotation velocity (vertical) in km/s. Each solid line is 
the MOND fit to the RC based on the distribution of light and 
neutral hydrogen. The other curves are the Newtonian RCs 
contributed by the gas (H and He) in the disk (short dashed) , 
by the stars (dotted) and by the central bulge (if present, long 
dashed) . The free parameter of the fit is the mass of the stars 
in the disk (and the mass of the bulge if present). Figure 
reproduced from Ref. |1C| with permission, from the Annual 
Review of Astronomy and Astrophysics, (c)2002 by Annual 
Reviews, www.annualreviews.org. 



stellar evolutionary models |T8f ; in this MOND does bet- 
ter than DM halo models [20| • To make similarly success- 
ful fits without just "putting DM where it is needed" , DM 
theorists must adjust two extra parameters apart from T 
in standardized halo models. As stressed in a recent crit- 
ical reappraisal [201 ] . those galaxies where one-parameter 
MOND fits do worse than halo fits often display compli- 
cating factors that could mitigate its less than striking 
performance. In disk galaxies MOND is unquestionably 
more economical, and thus more falsifiable, than the DM 
paradigm. 

When applied to many galaxies, the MOND fits men- 
tioned serve to determine ao with some accuracy; the 
value ao = 1.2 x 10~ 10 ms~ 2 is often used [2l|. It is sig- 
nificant that ao as deduced from RC fits agrees well with 
the value for ao obtained by comparing (GaoT) -1 with 
the empirical coefficient in the TF law: the roles of ao are 
different in the two subjects, and they are tied together 
only in MOND. Milgrom [22j notes a few additional con- 
ceptually distinct roles played by ao in extragalactic as- 
trophysics. An early example is that ao sets a special 
scale of mass surface density S m = clq/G, and wherever 
in a system the actual surface mass density drops be- 
low E m , Newtonian gravitational behavior gives way to 



MOND dynamics ((J. (In galaxies like ours this occurs 
some way out in the disk, and that is why the RC may 
exhibit a brief drop before becoming flat asymptotically) . 
On this basis Milgrom predicted that were disk galaxies 
with surface mass density everywhere below S m to exist, 
they should show especially large acceleration discrep- 
ancies. A population of such galaxies became known in 
the late 1980's [2l|, and all facets of Milgrom's predic- 
tion were subsequently confirmed ^23| . MOND obviously 
predicts that in these so called low surface brightness 
galaxies, the shapes of the RCs, which tend to be rising 
throughout the visible disk, should be independent of the 
precise way /j,(x) switches from x to unity for x ~ 1. 

Constraints of space forbid me from delving into other 
MOND successes, e.g. for elliptical and dwarf spheroidal 
galaxies. But excellent reviews of these and other phe- 
nomenological aspects of MOND exist 0, HHHl- It is 
well, however, to stress one prominent empirical difficulty 
faced by the MOND formula CQ). 

Clusters of galaxies exist which comprise hundreds of 
galaxies of various kinds moving in their joint gravita- 
tional field with velocities of up to 10 3 kms -1 . The New- 
tonian virial theorem (gravitational potential energy of a 
quiescent system equals minus twice its kinetic energy) 
can be used to estimate a cluster's mass since the kinetic 
term is proportional to the total mass while the poten- 
tial one goes like the square of mass. In this way it is 
determined that the total gravitating mass in a typical 
cluster is 5-10 times the mass actually seen in galaxies as 
well as in hot X-ray emitting gas, an oft constituent of 
clusters. A similar story is told by analyses of the hydro- 
statics of the hot gas in light of its measured temperature. 
Finally, the gravitational lensing by several clusters has 
led to estimates of their masses commensurate with the 
mentioned diagnosis. 

When the MOND formula^or even better, a MOND 
analog to the virial theorem [26| , is used to estimate the 
masses of clusters, one finds an improvement but not a 
full resolution to the acceleration discrepancy. Clusters 
still seem to contain a factor of two more matter than ac- 
tually observed in all known forms [1 0f | . The optimist will 
stress that MOND has alleviated the discrepancy with- 
out even once overcorrecting for it; the pessimist [27| 
will view this finding as damaging MOND's credibility. 
But one should keep in mind that clusters may contain 
much invisible matter of rather prosaic nature, cither 
baryons in a form which is hard to detect optically, or 
massive neutrinos 28, 29]. However, the option that clus- 
ters contain non-baryonic cold DM between the galaxies, 
while logically possible, seems hardly justifiable in view 
of MOND's overall philosophy. 



4. What is behind the MOND formula? 

What is the physical basis of MOND's success in unify- 
ing a lot of extragalactic data? There are certain things 
the MOND formula cannot be. First, it is unlikely to be 
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just a recipe for the way DM is distributed spatially in 
astronomical objects, as sometimes proposed [3(|. Such 
proposals strive to explain how the scale eto enters into 
spiral galaxy properties through some regularity in the 
cosmogony of DM halos. But strong arguments exists 
against an intrinsic scale of the halos [31( ■ And it is hard 
to see how such an explanation could give an account of 
the multiple roles of do in different systems (22j . 

The MOND formula, conceived as exact, cannot be a 
generic modification of the inertia aspect of Newton's sec- 
ond law ma = f (here / is the sum of all forces acting 
on the particle including the Newtonian gravity force), 
an alternative proposed originally by Milgrom [6] (see a 
latter elaboration in Ref. I32T K To see why consider a 
binary system with unequal masses mi and rri2 evolv- 
ing under mutual gravity alone. The time derivative of 
m,\V\ +1TL2V2, as calculated from Eq. ([1]), does not vanish 
in general if at least one of the accelerations is compara- 
ble to or smaller than ao, since the /x's will generally not 
be equal. So in the proposed interpretation, the MOND 
formula does not conserve momentum. By the same to- 
ken it does not conserve angular momentum, nor energy. 

Milgrom sidesteps the problem by stipulating that the 
MOND formula is only valid for test particles moving on 
a given background, e.g., stars moving in the collective 
gravitational field of a galaxy. We may inquire more gen- 
erally, does the MOND formula, or some closely related 
one, represent a modification of inertia in test particle 
motion? To comply with the conservation laws we likely 
want the formula to arise from a Lagrangian. The kinetic 
part of the Lagrangian should give us the fi(\a\/ao)a 
part with whatever "corrections" are required. However, 
it is a theorem that no MOND-like dynamics exists that 
simultaneously has a Newtonian limit for ao — * (all 
accelerations are large), is Galilei invariant, and is deriv- 
able from a local action [32[. ("Local" means that the 
relevant Lagrangian can be written as a single integral 
over volume.) This prohibition is even more stringent 
from a relativistic standpoint [HI]. Accordingly Milgrom 
introduced a nonlocal action, i.e. one which is a func- 
tional of complete orbits, but cannot be reduced to an 
integral over a Lagrangian [32| . This approach does not 
quite reproduce formula |T]) generically, but that formula 
is recovered for the special case of circular orbits. 

Of course circular stellar orbits a la MOND is really all 
that one needs to analyze data on spiral galaxies. How- 
ever, it is known that elliptical galaxies comprise highly 
radial stellar orbits too, yet elliptical galaxies also seem 
to be well described by MOND [34], and so the nonlo- 
cal modified inertia approach might be found wanting 
here. On the plus side one may mention the Pioneer 
anomaly [35j]. The Pioneer 10 and Pioneer 11 spacecraft, 
as they travelled almost radially in the outer parts of 
the solar system, were found to be subject to an anoma- 
lous sunward acceleration of order 8 x 10~ 10 ms -2 (not 
far from ao) which does not fall off measurably with dis- 
tance from the sun. Were such an acceleration to reflect 
a generic gravitational field, it would have affected the 



outer planet ephemerides to an intolerable extent [361 ]. 
(An ephemerides is the calculated position of a celestial 
body as a function of time, past or future.) By contrast, 
Milgrom's nonlocal theory does predict different modifi- 
cations of Newtonian dynamics for radial (Pioneer) and 
circular (planetary) orbits, though details have yet to be 
worked out [371 ]. 

Milgrom has also speculated [13, HI] that modified in- 
ertia may have its origin in an effective interaction of 
bodies with the vacuum. This is motivated by the well 
known fact that an object swimming through a fluid has 
its inertial mass increased by interacting with fluid de- 
grees of freedom. Of course, by local Lorentz invariance 
inertial motion through the vacuum is possible as usual. 
However, accelerated motion is distinguished from in- 
ertial motion (even before we come to inertia) by the 
perception that the accelerated system is immersed in a 
thermal bath whose temperature is proportional to the 
acceleration (Unruh radiation [3!|). Accordingly, Mil- 
grom suggests that accelerated motion with respect to 
the vacuum may shape the inertial characteristics of ob- 
jects in a way compatible with MOND. An alternative 
speculation [53, [38| is that MOND style modified iner- 
tia and its particular scale ao may be generated by a 
symmetry in a higher dimensional spacetime, just as we 
think of ordinary (Newtonian or Einstenian) inertia as 
reflecting Lorentz symmetry in Minkowski spacetime. In 
one implementation of this program, with the extremely 
symmetric deSitter spacetime playing the role of the em- 
bedding spacetime, a relation between the cosmological 
constant and ao surfaces, which is actually approximately 
satisfied if the observed acceleration of the Hubble expan- 
sion is interpreted as reflecting a nonzero cosmological 
constant. 



5. MOND as modified gravity 

What if, instead of the modified inertia interpretation, 
we follow Milgrom's alternative proposal 5] to regard 
MOND as a modification of Newtonian gravity? This 
entails a change in the r.h.s. of ma = /, with the new 
gravitational force possibly depending nonlinearly on its 
sources. Can such implementation be protected from vi- 
olation of the conservation laws? 

An affirmative answer to this query is readily avail- 
able by starting from a modification of the Lagrangian of 
Newtonian gravity with Milgrom's ao playing the role of 
characteristic scale, to wit 

Here F is some positive function, p is the matter's mass 
density and $, the field in the theory, is to be iden- 
tified with the gravitational potential that drives mo- 
tion, i.e. a = — V$. The Lagrangian's kinetic part is 
the most general one that depends only on first deriva- 
tives of <E>, and is consistent with the isotropy of space 
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(only |V$| appears). This Aquadratic Lagrangian the- 
ory (AQUAL) [40} reduces to Newton's in the limit 
F(X) — > X, when it readily leads to Poisson's equation 
for $. More generically it yields the equation 

V • [/i(|V$|/a )V$] = 4irGp, (4) 
»WX) = F'(X). (5) 

One identifies the p function here with MOND's p. 

Comparison of the AQUAL equation (j4]) with Poisson's 
for the same p shows that 

^(|V$|/a )V$ = V$tv + V x h, (6) 

where h is some calculable vector field. Now in highly 
symmetric situations (spherical, cylindrical or plane sym- 
metry) , use of Gauss' theorem with the AQUAL equation 
shows that V x h has to vanish. In light of the relation 
a = -V$, AQUAL then reproduces the MOND for- 
mula |T]) exactly. However, the curl term does not usu- 
ally vanish in situations with lower symmetry (Milgrom 
and Brada exhibit a curious exception [HI). We conclude 
that the MOND formula is generically an approximation 
to the solution of the AQUAL equation. 

How good an approximation? Work by Milgrom and 
Brada [ZJ [42j has shown that often the curl is just a 
5-10% correction. This explains how the MOND for- 
mula manages to be so successful while not constituting 
by itself a physically consistent theory. Now almost all 
MOND fits to observed RCs are calculated using that 
formula exclusively. It should thus be clear that modest 
departures of empirical RCs from MOND's predictions 
do not constitute evidence against the paradigm. The 
era of RC modelingwith the full AQUAL Eq. (4} is still 
in its early stages [431 ] . 

I mentioned in Sec. 4 that the MOND formula is in- 
consistent with the conservation laws. The AQUA1 the- 
ory is free of this problem. This needs no special proof 
(although explicit proofs exist [I(|): a theory based on 
a Lagrangian which does not depend explicitly on coor- 
dinates, directions and time (all true for AQUAL's La- 
grangian) automatically conserves momentum, angular 
momentum and energy. At the level of the first integral 
(J6j> of the AQUAL equation, it is the curl term which 
protects AQUAL gravity from violating the conservation 
laws. 

AQUAL consistently embodies the weak equivalence 
principle (all bodies, regardless of inner structure, and 
starting from the same initial conditions, follow the same 
path in a gravitational field). As remarked by Mil- 
grom [5|, the MOND formula is ambiguous in this re- 
spect. In a star accelerations of individual ions greatly 
exceed clq in magnitude. Thus for each ion p, should be 
very close to unity, and we would guess that p w 1 for 
the star as a whole. Accordingly, if regarded as a col- 
lection of ions, the star would follow a Newtonian orbit 
in the galaxy's field. This conclusion is entirely at vari- 
ance with the evidence from the flat RCs, or from the 
dictate of the MOND formula when applied to the star 



regarded as structureless. AQUAL predicts that the cen- 
ter of mass (CM) of a collection of particles, no matter 
what the internal gravitational field, will follow the orbit 
determined by a = — with $ the external potential 
coming from the AQUAL equation [iof . As mentioned, 
this acceleration will approximate that predicted by the 
MOND equation. AQUAL does indeed supply a physical 
basis for MOND. 

A system with sub-ao internal gravitational field im- 
mersed in an external field g e > ao is predicted by 
AQUAL to exhibit quasi-Newtonian internal dynam- 
ics [13]. Originally Milgrom conjectured this "external 
field effect" to explain why no acceleration discrepancy 
is evident in the loose open star clusters residing in the 
Milky Way [5]: the galaxy's field, still stronger than ao in 
its inner parts, suppresses the MOND behavior expected 
in light of the weak internal fields. By contrast, if the said 
system is immersed in an external field g e < ao, which 
is, however, stronger than its internal fields, the internal 
dynamics is predicted to be quasi-Newtonian, but with 
the effective gravitational constant G/ p(g e /ao) [iol ]. 

A useful product of AQUAL is a formula for the force 
between two masses in motion about each other, e.g. a 
binary galaxy. The MOND formula cannot be used di- 
rectly here because, as mentioned in Sec. 4, it would sug- 
gest that the binary's center of mass should accelerate 
(nonconservation of momentum). Instead, $ has to be 
computed from the AQUAL equation, and then pV^ 
has to be integrated over one galaxy's volume to obtain 
the force on it, as effectively done by Milgrom [42| . For 
point masses mi and m2 separated by distance r, the 
force is (Gmim2/r 2 )f(m2/rni,r/r t ) with r t = [G{m\ + 
7712) / 'ao] 1 / 2 ; Milgrom gives the graph of /. In the New- 
tonian limit (r <C rt), f ~ 1. In the deep MOND limit 
(r ^> r t ), Milgrom [26[ shows analytically that the force 
is (2/3)(Ga ) 1 / 2 r- 1 [(mi + m 2 ) 3/2 - m\ /2 - rn 2 ' 2 ]. This 
is most elegantly computed by exploiting the discovery 
that in that limit the AQUAL equation is conformally 
invariant, an approach that also yields the exact virial 
relation in the deep MOND regime [44j . 

AQUAL permits a study of galaxy disc stability in the 
MOND paradigm. I mentioned in Sec. I that Newto- 
nian disk instabilities motivated the idea that DM halos 
surround disk galaxies. In the MOND paradigm there 
is not DM, so how do a fraction of the spiral galax- 
ies in the sky avoid forming bars? Using a mixture of 
MOND formula and AQUAL equation arguments, Mil- 
grom [45| showed analytically that MOND enhances lo- 
cal stability of disks against perturbations as compared 
to the Newtonian situation. Brada and Milgrom [46| 
combined an iV-body code with a numerical solver for 
the AQUAL equation to verify this and to demonstrate 
that MOND enhances global stability. Both works con- 
clude that the degree of stabilization saturates in the 
deep MOND regime; this means no disk in MOND is ab- 
solutely stable. Christodoulou [13] and Griv and Zhyt- 
nikov [481 ] reach similar conclusions with ./V-body simula- 
tions based on just the MOND formula. Recently Tiret 
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and Combes (49j studied evolution of bars in spiral disks 
using TV-body simulations built on the AQUAL equa- 
tion. They find that bars actually form more rapidly 
in MOND than in DM halo Newtonian models, but that 
MOND bars then weaken as compared to those in Newto- 
nian models. The long term distribution of bar strengths 
predicted by MOND is in better agreement with the ob- 
served distribution than that obtained from Newtonian 
halo models. 

AQUAL is also important for studies of two-body re- 
laxation in stellar systems. Two-body relaxation is the 
process whereby distant gravitational encounters of pairs 
of stars in a galaxy or a cluster redistribute the system's 
energy and help it to approach some sort of equilibrium. 
In Newtonian theory two-body relaxation is excruciat- 
ingly slow. But as Binney and Ciotti show [50] , two-body 
relaxation in the framework of AQUAL proceeds faster. 
In fact, if the system is deep in the MOND regime, the 
speedup factor is the square of the acceleration discrep- 
ancy. 

A related effect is dynamical friction, whereby a mas- 
sive object moving through a collection of lighter bodies, 
e.g. a globular star cluster coursing among the stars of 
its mother galaxy, loses energy by virtue of its kicking 
the background objects gravitationally as it sweeps by 
them. As a result the heavy object settles in the grav- 
itational well confining it. In AQUAL for systems deep 
in the MOND regime, dynamical friction is speeded up 
over the Newtonian value by one power of the accelera- 
tion discrepancy [501 ]. 

Both of the above phenomena open a new window 
onto the MOND-Newtonian gravity dichotomy. Suppose 
AQUAL is the correct nonrelativistic description of grav- 
ity. In regard to RCs of spiral galaxies, Newtonian the- 
ory with DM could be "saved" by imagining the DM in 
each galaxy to be distributed in just such a way that the 
resulting total &n is just the <& generated in AQUAL 
theory by the baryonic (visible) matter. Admittedly, 
the required DM distribution might be peculiar (nega- 
tive density somewhere 51], or incompatible with cos- 
mogonic simulations). But leaving these options aside, 
observational uncertainties might make it difficult to tell 
the two theories apart, as recent fits of RCs of nearby 
galaxies based on high quality data testify [HJ . Yet con- 
sideration of the mentioned dissipative effects might re- 
move the effective degeneracy between the two models of 
a galaxy offered by the two theories. 

In view of the generic prediction of Newtonian cos- 
mogonic simulations that the DM distribution in a galaxy 
rises to a cusp at the center (see Sec. I), Newtonian dy- 
namical friction should have caused the globular clus- 
ters of many a dwarf spheroidal galaxy to already have 
merged at its center and there lost their integrity [53|. 
While globular clusters are indeed absent from several 
such dwarf companions of the Milky Way e.g. Draco, 
Sextans and Carina [HiJ , the Fornax dwarf spheroidal has 
five globular clusters near, but not at, its center. Goerdt, 
Moore, et al. [53| regard this as showing that the DM near 



the core of Fornax has a flat density profile, in contradic- 
tion to the DM simulations. In fact, there is other grow- 
ing evidence negating DM density cusps 0, [55[. Now, 
some of the dwarf spheroidal satellites of the Galaxy in- 
cluding Fornax are in the deep MOND regime. Following 
Binney and Ciotti Sanchez-Salcedo, Reyes-Iturbide 
and Hernandez 56] point out that the mentioned speedup 
of settling by dynamical friction in AQUAL should long 
ago have caused the globular clusters in Fornax to amal- 
gamate at its center. AQUAL thus has a problem ex- 
plaining the observations of Fornax; however, as we see 
the DM alternative has its problems too. 

Can MOND give a picture of the late stages of galaxy 
formation? A step in this direction was taken by Nipoti, 
Londrillo and Ciotti on the basis their earlier work 
with TV-body codes incorporating the full AQUAL equa- 
tion (43|. They simulate the dissipationless collapse of a 
cloud of particles which ends up in the MOND or deep 
MOND regimes. They note that phase mixing (the dis- 
appearance of structure in the initial conditions due to 
commingling of particle orbits) proceeds slower than in 
the counterpart Newtonian simulations, and that the fi- 
nal velocity distribution is anisotropic and favors radial 
orbits. Current thought in cosmogony is that elliptical 
and spheroidal galaxies are the outcome of anisotropic 
collapse after star formation is complete. Nipoti et al. 
note that the end-products of some of their simulations 
resemble, in their surface density and velocity dispersion 
profiles, the dwarf-elliptical and dwarf-spheroidal galax- 
ies in the sky. But other final states have trouble fitting in 
all the accepted empirical correlations between luminos- 
ity, radius and velocity dispersion of elliptical galaxies. 



6. Can MOND be made relativistic? 

Why is a relativistic version of MOND important? Al- 
though most astronomical systems with significant ac- 
celeration discrepancy are nonrelativistic in the extreme, 
there are two important exceptions. Cosmology, which 
is important in itself and as a framework for the process 
of galaxy formation, is widely regarded as requiring the 
presence of much DM on the largest scales, and cosmol- 
ogy is universally believed to require relativistic treat- 
ment. Gravitational lensing by galaxies and clusters of 
galaxies has become a salient astronomical tool in the 
last two decades, and, since it discloses acceleration dis- 
crepancies, it is commonly regarded as supporting the 
need for DM. Since light propagates with speed c, non- 
relativistic MOND cannot even begin to formulate the 
problem of gravitational lensing. 

The above cases drove the relativistic formulation 
of AQUAL [H, which I shall refer to as RAQUAL. 
RAQUAL retains Einstein's equations of GR as a tool 
for deriving the spacetime metric g a $ from the energy- 
momentum tensor of the matter (one of whose compo- 
nents represents matter density) . RAQUAL further stip- 
ulates that matter and radiation play out their dynamics, 
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not in the arena of g a p, but in that of g a p — e 2 ^/ c g a p, 
where ip denotes a scalar field with dimensions of squared 
velocity. Since all matter is treated alike in this respect, 
the exquisitely tested weak equivalence principle is safe- 
guarded, but since the metric governing gravitational 
field dynamics is different from the one experienced by 
matter, the strong equivalence principle fails. This is es- 
sential, for otherwise we should end up back with GR 
as the relativistic gravity theory, and GR in the nonrel- 
ativistic limit leads to Newtonian theory, not to MOND. 

How is ip determined in RAQUAL? One takes the La- 
grangian for ip to be the covariant version of that for 
$ in AQUAL, Eq. ©. This means replacing |V$| 2 h-> 
g a ^d a ip dpip as well as d 3 x i— > (— g) 1 ^ 2 d 3 x, where g is the 
determinant of the matrix of components of g a g. But in 
RAQUAL one does not include a term in lieu of /?$; the 
coupling between tp and matter, which is to provide the 
source of the equation for ip in accordance with its La- 
grangian, is automatically generated by the e^/ c factor 
in the metric g a p with which we built the matter's La- 
grangian. For time independent situations with nonrela- 
tivistic matter sources, the ip equation reduces to Eq. 
with $ I— > ip. However, it would be a mistake to conclude 
that in RAQUAL ip is the gravitational potential. 

In GR for weak gravity and nonrelativistic motion, the 
potential in which matter moves, <E>at, is related to the 
metric by <J>at = — §(<?« + c 2 ) (t is the time coordinate). 
Because, in this linearized theory, the most relevant Ein- 
stein equation reduces to Poisson's, $jv coincides with 
the usual Newtonian potential. In RAQUAL g a p still 
satisfies Einstein-like equations, so the above relation ap- 
plies as well. But the stipulation that matter's dynam- 
ics go forward in the metric g a /3 determines the nonrel- 
ativistic potential in which matter moves, now called <3 ) , 
to be $ = —\{gtt + c 2 ). Writing in linearized theory 
g tt = (1 + 2ip/c 2 )gtt, we have $ = $ N - g tt ip/c 2 . Now 
to first approximation g tt — — c 2 ; thus to leading order 

$ = $at + ip. (7) 

Consequently, in RAQUAL the gravitational potential 
traced by the matter's motions is the sum of the Newto- 
nian potential and the AQUAL scalar field, both gener- 
ated by the same baryonic matter. The ip does the job 
of the DM's potential in conventional approaches, but its 
source is ordinary matter. 

Let us compare RAQUAL with the successful AQUAL. 
We take the \x in Eq. (g]) with $ ^ ip to be approximately 
equal to its argument for |V?/>| <C do, and to level off 
at a value hq when |V-0| S> «o- Then for strong |V^| 
Poisson's equations gives ip = &n/ho so that the true 
potential is $ = (1 + l//j,o)^Af- This means gravity is 
Newtonian with effective gravitational constant Gjv = 
(l + l//i )G. For | Vtp\ < a , the RAQUAL equation tells 
us that V^i is much stronger than V<&at. Consequently, 
by Eq. 0, V<f> w V-0 so that in the weak field regime 
RAQUAL's gravitational field is close to AQUAL's. Thus 
in RAQUAL we have a relativistic theory which in the 



nonrelativistic limit inherits the good traits of MOND by 
way of AQUAL. 

However, on the relativistic front RAQUAL hit two 
roadblocks. Already the original paper [4(| remarks that 
perturbations of tp from a static background for which 
|V^i| <C do, e.g. ip waves travelling in the reaches of a 
galaxy, propagate superluminally, i.e., outside the light 
cone of g a fj . Nowadays one hears the view that superlu- 
minality by itself is not sufficient reason for causal prob- 
lems [58j . This is by no means the standard view, and so 
the specter of acausality motivated disillusionment with 
RAQUAL [H, H3| , and formulation of a two-scalar field 
theory, phase-coupled gravity (PCG) [H, [6l| intended 
to prevent supcrluminality. In the PCG Lagrangian the 
new scalar a contributes a quadratic kinetic part, just as 
does ip; there is also a simple coupling between a and 
the kinetic term for tp which gives the theory its MOND 
character. Although PCG generates an intolerable drift 
of the Kepler constant a 3 / P 2 in the solar system, and 
marginally contradicts the observed precession of Mer- 
cury's perihelion [5^], the theory has merits for both 
galaxy dynamics and cosmology 62, 63]. However it may 
be, RAQUAL and PCG were both finally disqualified as 
consistent representations of gravity by their inability to 
cope with observations of gravitational lenses. 

Discovered in 1979, gravitational lenses have been in- 
tensively studied ever since. In strong gravitational lcns- 
ing a cluster of galaxies (but sometimes a single galaxy) 
forms a few images of a single quasar in the background 
by bending the incoming light rays with its gravita- 
tional field. This provided a novel tool for determina- 
tion of masses of large astronomical systems by an ele- 
mentary use of GR. By the late 1980's it was apparent 
that masses of clusters so determined were larger than 
the observed baryonic mass in them, and commensurate 
with the masses determined from motions of the galaxies 
comprising them. The novelty was that the acceleration 
discrepancy was now put in evidence by a relativistic 
phenomenon. Many pounced upon the finding as added 
support for DM's presence in clusters. 

RAQUAL cannot deal with the new evidence; the rea- 
son is instructive. Maxwell's equations are known to 
be conformally invariant: replacement of the metric gap 
with which they are formulated by fix 1 ) g a p does not 
change the form of the equations. It follows that study 
of light propagation cannot distinguish between the met- 
rics g a p and g a p of RAQUAL. Accordingly, although 
Maxwell's equations are supposed to be written with 
g a /3, the metric for studying gravitational lensing might 
just as well be g a p- But this last obeys Einstein's equa- 
tions whose sources are the baryonic matter's energy- 
momentum tensor and that of ip. However, for systems 
as distended as clusters or galaxies, this last is negligi- 
ble. The light bending, then, is predicted by RAQUAL 
to be nearly the same as predicted by GR without DM. 
Yet there is observational evidence that the inner parts 
of clusters gravitationally lense light more strongly than 
would be expected from GR with no DM. The same prob- 
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lem recurs in PCG, in which the two metrics are, again, 
conformally related in the same manner as in RAQUAL. 

It is thus quite clear that if a scalar field is to be the 
vehicle for MOND effects, then regardless of the form 
of its dynamics, the relation between the metrics g a p 
and g a /3 must be non-conformal. I attempted [64| to 
construct such a coupling out of scalar field alone, to wit 



g af 3 = e 2 ^ /c \Ag a p + Bij}, a i>^), 



(8) 



with A and B functions of the invariant g al 4>,a 9- How- 
ever, Sanders and I found that avoidance of propagation 
of gravitational waves which are supcrluminal with re- 
spect to the matter metric g a p requires that B < 0, while 
A > is required so that both metrics have Lorentzian 
signatures [65| . We then calculated that for equal sources 
of Einstein's equations, the light bending in the proposed 
modification of RAQUAL is weaker than that in GR. As 
mentioned, in systems of the scale of galaxies and clus- 
ters of galaxies, the scalar field provides very little energy 
or momentum density as compared to the matter. Thus 
in regard to light ray bending, the new theory performs 
worse than RAQUAL. 

Actually the above conclusion could be avoided if «/> Q is 
a timelike 4-vector. However, we would expect that deep 
inside or near a galaxy, or a massive cluster of galaxies 
which is certainly virialized, the scalar field, whose prin- 
cipal source is the matter therein, should be quasistatic. 
Then 4>,a is expected to be spacelike, i.e. to give lensing 
weaker than needed. The problem is thus still present. 

To overcome it Sanders [66| proposed to replace ip tCt in 
its above role by a nondynamical vector lA a which in an 
isotropic cosmological model points precisely in the time 
direction, and approximately so in the presence of mass 
concentrations. He further related the metrics by 
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Sanders further supposed the action for the scalar field 
</>, which supersedes ip, to be of RAQUAL form, or more 
generally with the function F depending also on the 
scalar U a 4>,a ■ The timelike nature of the vector is con- 
ducive to enhancement of the lensing, and such a theory, 
named stratified theory, can cope with the observations of 
gravitational lensing. However, it was clear to Sanders 
that this is just a toy theory since a globally timelike 
vector field, such as U a , determines a universal preferred 
frame of reference, an aether. Additionally, since the vec- 
tor field is supposed constant, the theory is not covariant. 



7. TeVeS and other relativistic MOND theories 

The mentioned problems are avoided by making the 
vector field a dynamical one, as first done in TeVeS, 
my tensor- vector-scalar theory of gravity [6?], [H, H^]. 
TeVe S retains the relation © between a gravitational 
metric g a j3 and a physical (or observable) metric g a &- 
The matter Lagrangian is built exclusively with g a (3', this 



guarantees that the weak equivalence principle will be 
obeyed precisely. The conventional GR Einstein- Hilbcrt 
Lagrangian is used to give dynamics to g a p, which then 
trivially induce dynamics for g a p. The scalar field <p is 
provided with a RAQUAL type Lagrangian. For conve- 
nience this is couched in terms of a quadratic form in the 
4-gradient of <j> in interaction with a second scalar field, 
<7, which also occurs in a potential-like term. (However, 
no derivatives of a enter, so a is not dynamical, and 
if eliminated, the explicit aquadratic Lagrangian for <p 
shows up.) The mentioned quadratic form is not the 
usual one, but rather (g a P — U a U^)4>. a <$>,$', it is intro- 
duced to forestall the superluminal propagation that af- 
flicts RAQUAL. 

The kinetic part of the lA a Lagrangian is quadratic in 
its first derivatives, and exactly analogous to that for a 
gauge field. This is not the only quadratic form possible, 
but it is the one in which the Einstein equations for g a p 
will not have second derivatives of U a in their sources. As 
mentioned, it is imperative that U a be a timelike vector. 
In TeVe S this is accomplished by introducing, alongside 
the kinetic term, a Lagrange multiplier term of the form 
X(U a U a + 1) which forces the norm of U a to be negative 
(and U a to be of unit length as a bonus). 

Whereas RAQUAL is a one parameter (eto) theory with 
one free function (/x), TeVe S has one free function, F(a 2 ) 
[distinct from the F in Eq. l[5])] that determines the shape 
of the potential-like term, and three parameters in addi- 
tion. One of these is a scale of length I in the cj) La- 
grangian that sets the strength of the potential, while 
a dimensionless one, k, determines the scale of its argu- 
ment. The third parameter, K, also dimensionless, sets 
the strength of the U a Lagrangian. One can form the 
acceleration scale 
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which parallels the role of Milgrom's constant in MOND 
and AQUAL. It is found that the limit of TeVeS with 
k — > 0, K cx k and i cx fc~ 3 / 2 amounts to GR. In fact 
ao — » in this limit, so that all accelerations are to be 
regarded as strong, and gravitation as conventional. 

How about nonrelativistic motion? Let <p c be the co- 
eval value of <j) in the cosmological model in which the 
nonrelativistic system is embedded. This will be the 
asymptotic boundary value of 4> for any local solution 
of the 4> equation. From a linearized version of Einstein's 
equations, and use of the relation ©, it follows [67[ that 
the effective nonrelativistic gravitational potential is 



<E> = S<I>Ar - 
E = (1- K/2)- ' . 



(11) 
(12) 



(this corrects a sign error in Refs. 167] andl68l). Here $^ 
comes from the usual Poisson equation while 4> is found 
from the scalar equation; in both the source is the bary- 
onic mass density p. Thus far all studies have assumed 
that < K < 1 and \<p c \ < c 2 , in which case Eq. (JTTJ) 
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essentially coincides with Eq. ([7]) in RAQUAL. For this 
same reason any time variation of 2 [7pJ should be neg- 
ligible. This is of particular significance for the above 
derived value of ao which, strictly speaking, includes a 
factor S [for example Eq. (6.7) of Ref. |68j. 

For a static situation the equation for <p takes a form 
like Eq. with /i (essentially a 2 ) expressed in terms of 
| V0| through TeVe S's free function F(a 2 ). By a suitable 
choice of F one can reproduce any fi function that would 
be relevant in AQUAL, with the scale thereon agreeing 
with ao in Eq. (jlOp . By analogy with our discussion 
of RAQUAL we see that TeVe S can have MOND phe- 
nomenology and also a Newtonian limit. The transition 
between Newtonian and MOND regimes takes place as 
|V(/>| sweeps through the value ao, near the point where 

|V$| S3 ao. 

Actually for systems strongly departing from spheric- 
ity, the situation is not as clear, but it seems that at 
the nonrelativistic level TeVe S implements the MOND 
paradigm overall. When it comes to details, my origi- 
nal choice for F [(57j can be criticized: while it leads to 
flat outer RCs, it does not give a satisfactory account of 
the transition part of the RCs of several well measured 
galaxies, including our own [7l], [t3] - By contrast, the 
MOND formula with the "simple" variant of Migrom's 
[i (FIG. El) leads to very good fits for many RCs [z3|. 
Forms of the TeVe S function F which would do a cor- 
respondingly good job have been proposed by Zhao and 
Famaey (73|, Sanders [72j and Famaey, et al. [70] |. 

Sanders has also proposed a variation on TeVe S in- 
volving a second scalar a with PCG-like dynamics in- 
stead of RAQUAL ones 74[ . This bi-scalar tensor vec- 



tor theory (BSTV) has three free functions and a free 
parameter. The theory is not especially needed to obvi- 
ate superluminal propagation since TeVe S seems to do 
well on this [67[. However, BSTV, like PCG, is a more 
appropriate frame for generating cosmological evolution 
of ao- Since numerically ao ~ cHq, it is often argued 
that ao must be determined by cosmology, and should 
thus vary on a Hubble timescale [H, [l(| • As mentioned, 
ao in TeVe S is essentially set by the parameters i and fc; 
it should be nearly constant in the expanding universe. 
It is otherwise in BSTV. Discrimination between con- 
stant and evolving ao may be possible with good RCs of 
disk galaxies at redshifts z = 2 — 5. Such curves are just 
now coming into range. In fact, the data in Ref. [75| put 
the z = 2.38 galaxy BzK-15504 right on the MOND de- 
rived Tully-Fisher law with the standard value for ao [zE| • 
Thus the meagre data available today are consistent with 
no ao evolutions on the Hubble timescale. 

Recently Zlosnik, Ferreira and Starkman [77| have 
clarified the relation between TeVe S and the so called 
Einstein- Aether gravity theories in which a timelike unit 
vector field (but no scalar) plays a role alongside the met- 
ric [78[. They do this by re-expressing TeVeS solely 
in terms of the observable metric g a p and the vector 
field. The (f> is eliminated with the help of the constraint 
imposed through the Lagrange multiplier term. There 



emerges an Einstein- Aether like theory in which the met- 
ric g a p also satisfies Einstein-like equations. However, in 
contrast to orthodox Einstein-Aether theories, the vec- 
tor kinetic action in the theory in question is a generic 
quadratic form in the vector's derivatives contracted into 
polynomials of the vector's components. This form of 
TeVe S is more complex than the original one, but there 
are some circumstances in which it may be more conve- 
nient in ferreting out consequences of the theory. 

In TeVe S in either form the vector is normalized with 
respect to g a p, not g a p. Zlosnik, et al. have also proposed 
a variant tensor- vector theory with a timelike vector nor- 
malized with resp ect to g a /s, and which also has MOND 
like behavior [791 ] . This is constructed by taking the vec- 
tor's action as a function T of /C, the quadratic form in 
the derivatives of the vector field of orthodox Einstein- 
Aether theories. The Zlosnik, et al. theory has four pa- 
rameters: a length scale and three dimensionless parame- 
ters. The form of the free function T{JC) can be deduced 
approximately from the requirement that MOND arise in 
the nonrelativistic quasistatic limit, and from the stipu- 
lation that a cosmology built on this theory shall have an 
early inflationary period and an accelerated expansion at 
late times. 

In summary, the relativistic implementations of 
MOND involve either one or three free functions; those 
with one free function have either three or four free pa- 
rameters. 



8. TeVe S and gravitational lensing 

As stressed earlier, the particular structure of TeVe S 
reflects the desire to encompass in the MOND paradigm 
the observation that what passes, from a dynamical point 
of view, for DM in galaxies and clusters of galaxies also 
lenses light to a commensurate degree. How does TeVe S 
measure up to the task? Now the measured gravitational 
lensing by galaxies and clusters of galaxies takes place 
over cosmological distances. The light rays in TeVe S 
are null geodesies of the observable metric g a p ; to com- 
pute g a p one needs U a ,(j) and g a p. Isotropic cosmologi- 
cal models in TeVe S closely resemble the corresponding 
cosmological models of GR; they sport a U a pointed pre- 
cisely in the time direction and feature very slow change 
of <p [13 ■ A consequence is that g a p in a TeVe S isotropic 
cosmological model differs little from the metric in the 
corresponding GR cosmology. Thus the cosmological 
facet of gravitational lensing is very much like in GR. 

The second facet is the local bending of light rays in 
a mass' vicinity. As in isotropic cosmological models, so 
in static situations, the solution of the vector's equation 
has U a pointed precisely in the time direction. To com- 
pute the bending in linearized theory, the approximation 
commonly used in the business, one also needs the scalar 
field 4> and the metric g a f3, both to first order in <&at. The 
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final result is that the line element takes the form 

d~s 2 = -(1 + 2<£/c 2 )dt 2 + (1 - 2$/c 2 )(dx 2 + dy 2 + dz 2 ) 

(13) 

with $ given by Eq. (fTTj) . Note that the same poten- 
tial $ appears in both terms of this isotropic form of the 
line element. Hence light ray bending, which leans on 
both to equal degree, measures the same gravitational 
potential as do dynamics which are sensitive only to the 
temporal part of the line element. This mirrors the sit- 
uation in GR whose line element is obtained by sending 
$ i-> $ w in Eq. (fT3|) . Thus half of the problem that 
plagues RAQUAL and similar theories is overcome: the 
acceleration discrepancy makes itself felt equally through 
the gravitational lcnsing as through the dynamics. 

The second half of the problem revolves about the mass 
distribution that generates In GR $ is all there is, 
and its Laplacian, as determined from the lensing obser- 
vations or from the dynamics, will give the total (bary- 
onic plus dark) mass distribution directly. Because DM 
is not seen directly, this prediction can only be judged 
by the plausibility of the derived distribution of DM. By 
contrast in TeVe S the measured $ is to be decomposed 
into two parts in the manner of Eq. (|11[) . with each gen- 
erated by the same baryonic mass density p, one part 
through Poisson's equation, and the second through a 
highly nonlinear AQUAL type equation. Evidently, in a 
TeVe S model of a gravitational lense, the baryonic mat- 
ter will be distributed differently from the total matter 
in a GR model of the same lense. And when the lensing 
system is not spherically symmetric, the centers of the 
two distributions may be offset. 

Chiu, Ko and Tian [8(| have explored light ray bend- 
ing by a pointlike mass M in TeVeS. They note that 
the deflection angle in the deep MOND regime [impact 
parameter b bo = {k / ^){GM / ao) 1 ^ 2 ] approaches a 
constant, as might have been expected from naive argu- 
ments, but is less predictable in the intermediate regime 
b ~ bg. Thus calculations of lensing based on a mixture 
of MOND and GR motives [8l|, [82| can easily mislead. 
Chiu et al. work out the lens equation in TeVeS, which 
controls the amplifications of the various images in the 
strong lensing of a distant source, and remark that for 
two images the difference in amplifications is no longer 
unity as in GR, and may depend on the masses. With a 
photometric survey like the Sloan Digital Survey it may 
be possible to check for this effect. Finally, these authors 
work out the gravitational time delay in TeVeS, which 
could be of use in interpreting differential time delays in 
doubly imaged variable quasars. 

More phenomenologically oriented, Zhao, Bacon, Tay- 
lor and Home 83] compare TeVe S predictions with a 
large sample of galaxy strong lenses which each produce 
two images of a quasar. They model the galaxy baryonic 
mass distribution either by point masses or by the pop- 
ular Hernquist profile. Galaxy masses are estimated by 
comparing observations both with predicted image posi- 
tions and with predicted amplifications ratios; the two 
methods are found to give consistent results, themselves 



well correlated with the luminosities of the galaxies. The 
corresponding mass-to-light ratios are found to be in the 
normal range for stellar populations, with some excep- 
tions. 

How frequently should strong lensing occur? This 
question is taken up in the context of TeVe S by Chen and 
Zhao [U- Again modeling the mostly elliptical galaxies 
with Hernquist profiles, they compute the probability of 
two images occurring as a function of their separation. 
Their prediction falls somewhat beneath the frequency 
observed in the lensing surveys, though they consider this 
still acceptable. The predictions are sensitive to the as- 
sumed mass profile, as well as to the assumed shape of 
the fj, function, which Chen and Zhao assume to switch 
brusquely from linear in the argument to unity. 
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FIG. 6: The colliding clusters 1E0657-56. The bullet clus- 
ter (right) rammed through the cluster on the left. Hot gas 
stripped off both clusters is colored red-yellow. Green and 
white curves are level surfaces of gravitational lensing conver- 
gence; the two peaks of this do not coincide with those of the 
gas which makes up most of the visible mass, but are skewed 
in the direction of the galaxy concentrations. The white bar 
corresponds to 200 kpc. Figure reproduced from Ref. [85| by 
permission of the American Astronomical Society. 

Gravitational lensing by the colliding galaxy clusters 
1E0657-56 has been claimed to give theory independent 
proof of DM dominance at large scales [82| • In this sys- 
tem (FIG. [6]) a smaller cluster, the "bullet" , has crashed 
through a larger one and the intracluster gas of both has 
been stripped by the collision, the bullet's gas trailing be- 
hind its galaxy component. Weak lensing (distorted but 
unsplit images) mapping shows the lensing mass to be 
concentrated in the two regions containing the galaxies, 
rather than in the two clouds of stripped gas which con- 
tains the lion's share of baryonic mass [82|, |85( . Collision- 
less DM would indeed move together with the galaxies. 
Hence the inference that much DM continues to accom- 
pany the bullet. Angus, Famaey and Zhao [8(| (see also 
Ref. |73| ) point out that in the very asymmetric system 
1E0657-56, MOND. AQUAL and TeVe S all predict sub- 
stantially different gravitational field distributions (com- 
pare Eqs. |1} and ©), a situation which confuses Clowe 
et al.'s "theory independent" inference. 
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Whereas Angus, Famaey and Zhao consider it pos- 
sible to explain the lensing with a reasonable purely 
baryonic matter distribution, a later paper by Angus, 
Shan, et al. [87| concludes that dark matter is needed 
after all. This is hardly surprising; as we saw in Sec. 3, 
pure MOND does not fully account for the acceleration 
discrepancy in the dynamics of quiescent galaxy clus- 
ters [lOj . But DM models of the bullet clusters within GR 
are not without their problems. Farrar and Rosen [88| 
note that the relative velocity of the clusters is too high 
as compared to those seen in DM simulations of struc- 
ture formation. To remove the contradiction they pro- 
pose that a non-gravitational attraction of a new sort 
acts only between clumps of DM. But is assuming ex- 
istence of DM together with a new interaction specific 
to it more parsimonious than a modification of standard 
gravity such as MOND? 



9. TeVe S and cosmology 

With my original choice of F, TeVe S cosmologi- 
cal models with baryonic matter content alone can be 
very similar to the corresponding GR models by virtue 
of the scalar (A's energy density remaining relatively 
small H3, . In particular Chiu et al. [sy] note that 
these TeVe S models give a reasonable relation between 
redshift and angular distances, and are thus as effective 
as GR models in providing the scaffolding for the analysis 
of cosmologically distant gravitational lenses. How would 
changing F affect the evolution of a cosmological model? 
This is studied exhaustively by Bourliot et aL_j89| follow- 
ing an earlier exploration by Skordis et al. [901 ] . Bourliot 
et al. display a large set of variants of my F for which, 
while the scalar energy density may not be negligible, it 
tracks or mimicks the behavior of another energy compo- 
nent of the model, e.g. the radiation's during radiation 
dominance. They also characterize shapes of F which 
can lead to future singularities in cosmology, and which 
presumably should be avoided. 

MOND critics have always held up the complicated 
power spectrum of cosmological perturbations (back- 
ground radiation or baryons) as a proof that DM is 
needed on the cosmic level; after all the spectrum is said 
to be well fit by the "concordance" DM model of the uni- 
verse. This argument took for granted that MOND could 
never measure up to the test. With TeVe S on the scene 
one can face the question technically. 

In a massive work Skordis has provided the full co- 
variant formalism for evolution of cosmological pertur- 
bations in TeVe S [9l| . And using this Skordis et al. [90| 
have shown that, without invoking dark matter, TeVe S 
can be made consistent with the observed spectrum of 
the spatial distribution of galaxies and of the cosmic mi- 
crowave radiation if one allows for contributions by mas- 
sive neutrinos (in the still allowed mass range) and the 
cosmological constant. In this approach the role of dark 
matter in standard cosmology is taken over by a feed- 



back mechanism involving the scalar field perturbations. 
Dodclson and Liguori [92| have independently calculated 
perturbation growth, and stressed that it is rather the 
vector field in TeVe S which is responsible for growth of 
large scale structure without needing DM for this [92| . It 
thus seems there is potential in TeVe S to do away with 
cosmic DM, as well as with DM in galaxies. 

Much attention is commanded today by the mystery 
of the "dark energy" , the agent responsible for the ob- 
served acceleration of the Hubble expansion. Such agent 
is obligatory in the context of GR cosmological models. 
Quite naturally many have wondered whether TeVe S 
could obviate this need. Diaz-Rivera, Samushia, and Ra- 
tra [93[ have found exact deSitter solutions of TeVeS 
cosmology which can represent either early time infla- 
tion epochs or the late time acceleration era. In another 
TeVe S study, Hao and Akhoury [94| have concluded that 
with a suitable choice of the TeVe S function F, the scalar 
field can play the role of dark energy. And Zhao 19511 
maintains that with Zhao and Famaey's choice of F [73| , 
cosmological models can be had that evolve at early times 
like those of standard cold dark matter cosmology, and 
display late time acceleration with the correct present 
Hubble scale, all this without assuming DM or dark en- 
ergy. For the related Einstein-Aether theory, Zlosnik et 
al. [79| remark that with suitable choice of their theory's 
T , the vector field can both drive early inflation as well 
as double for dark energy at late times. 



10. Assessment of relativistic MOND 

It may be unnecessary to go far from Earth to tell 
TeVeS, BSTV and similar theories apart from GR; the 
solar system can be turned into a sieve for the correct 
modified gravity. Between any two bodies in the so- 
lar system there is an extremum point, strictly a saddle 
point, of some relevant field. In AQUAL this would be 
the <E> potential and in TeVe S the (f> field. Near such 
a point the gradient of the said field is small, and de- 
partures from Newtonian gravity are significant (/j, in 
AQUAL falls short of unity, and the derivative of F 
in TeVe S is driven away from the pole which signals 
Newtonian behavior). A detailed study by Magueijo and 
me [%[ with TeVeS shows that the anomalous regions, 
e.g. between Sun and Jupiter and Earth and Moon, are 
small, but gives some hope that with the fine guidance of 
space probes like ESA's LISA-Pathfinder (scheduled for 
liftoff in October 2009), it may be possible to diagnose 
MOND-like effects. 

Well away from the saddle points, but still in the re- 
gion occupied by the planets, TeVe S predicts small de- 
partures of (j) from 1/r form, i.e. small departures from 
Newtonian behavior as encoded in <!> [67| . Similar pre- 
dictions issue from BSTV 



72J . These are constrained by 



the observed constancy of Kepler's constant a / P out 
to the orbits of the major planets, and by the absence 
of significant departures from GR's predictions for the 
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precessions of the perihelia of Mercury and the asteroid 
Icarus. According to Sanders, TeVe S can satisfy these 
constraints with choices of the function the F proposed 
by Refs. [zl and [zl, which, as mentioned in Sec. 7, also 
lead to correct galaxy RCs. BSTV can do just as well 
when its extra parameter, e, is chosen properly [7^ |. 

Both TeVe S and BSTV with the above choices pre- 
dict that beyond 100 astronomical units (AU) from the 
sun, there is a fairly r-independent attractive component 
of the sun's gravitational field of strength ~ 0.3a . The 
Pioneer anomaly, as measured at distances beyond 20 
AU, is also r-independent but stronger. Is this evidence 
for MOND? Unfortunately, the interpretation of the Pio- 
neer anomaly as an almost constant sunward acceleration 
seems to clash with limits on the variation of Kepler's 
constant set with spacecraft at the distance of Uranus 
and Neptune [zl, [97 1 a nd with the latest ephemerides 
of the outer planets 36| . It thus seems prudent to sus- 
pend judgment until a prosaic origin for Pioneer anomaly, 
e.g. drag by unknown matter, can be excluded with high 
probability. 

The above focuses on the nonrelativistic limit. How do 
relativistic MOND theories fare with regard to the low- 
est order relativistic effects in the solar system? As for 
any metric theory of gravity, these effects can be calcu- 
lated from the post-Newtonian (PPN) parameters of the 
theory of choice. For any metric theory it is possible to 
parametrize the first and second order departures of the 
metric from Minkowski form in terms of a set of ten in- 
tuitive looking potentials [98[. The Newtonian potential 
<&at figures in the list, and occurs in the corrections both 
linearly as well as squared; two others are 



* 2 = / ^')*"fr'W 



Vi 



\r — r 



(14) 

where v denotes the fluid velocity in the system. The 
post-Newtonian (PPN) parameters are the dimensionless 
coefficients multiplying the diverse potentials in the cor- 
rection. Some examples will clarify this. 

The correction to the space-space part of the metric, 
gij, is 2j^pf5ij; it defines the PPN parameter 7. The 
correction to the temporal metric component, g t t, starts 
with the terms 2§ N — 2/3$jv 2 , which define the PPN pa- 
rameter /3, and also includes a $2 term whose coefficient 
brings in a further PPN parameter, (1. Each time-space 
metric component gu gets corrections proportional to the 
Vi and Wi§N (w denotes the velocity of the chosen co- 
ordinate system with respect to the cosmological matter 
comoving frame). With these last terms come two ad- 
ditional PPN parameters, a% and a^, which are called 
preferred frame parameters. A third one, 0:3, is associ- 
ated with a correction of the form k; 2 $jv to gu- There 
are four more PPN parameters for a total of ten. 

The parameters j3 and 7 are both unity for GR and 
for some of it competitors. They have also been com- 
puted to be unity in TeVe S [67], [9i| , which thus fares 
as well as GR in reference to gravitational light bending, 
perihelia precessions of the planets, and the radar time 



delay. Sanders [72| has provided a simple argument that 
(3 and 7 are always unity in a class of theories (including 
BSTV) which start from the disformal relation ©. 

Next in order of relevance are the preferred frame pa- 
rameters ai,a2, and 013. These vanish in GR; after all, 
GR has no preferred frames. In TeVe S or BSTV at least 
one of the as should be nonvanishing because the vector 
U a establishes a locally preferred frame, with the vector 
pointing out the time direction in that frame. Sanders 
argues heuristically that the as should be strongly sup- 
pressed in both theories [72j]. But explicitly computing 
the a's for relativistic MOND theories should be a high 
priority because they are subject to tight experimental 
bounds. 

The odyssey in search of a relativistic embodiment 
of the MOND paradigm has led, not to one relativis- 
tic MOND, but to many. TeVeS by itself exemplifies 
a family of theories. First there is the freedom in the 
choice of the function F, which is still only modestly 
constrained [ZlL [zl, [zH • Next, the coefficient of the sec- 
ond term in the factor (g a P —U a U^)(f)^ a (f) @ in the scalar's 
Lagrangian can be changed to any other negative num- 
ber. In TeVe S the Lagrangians for <f> and lA a are formu- 
lated upon the metric g a p] a different but related theory 
emerges if one uses instead the background of g a p. Ad- 
ditionally one can replace the aquadratic Lagrangian for 
by a PCG-style Lagrangian for two scalars, as indeed 
done in Sanders' BSTV. Finally, one can altogether dis- 
pense with the scalar fields and go the way of the Zlosnik- 
Ferreira-Starkman theory [zl|. 

While the lack of uniqueness of relativistic MOND is 
a nuisance, it does add needed flexibility to the search 
for a "final" fundamental theory with which to underpin 
the MOND paradigm. It is already clear that some of 
the above mentioned theories may not be in full accord 
with the facts. For example, the published TeVe S has an 
exact MOND limit at low accelerations, yet as mentioned 
in Sec. 3, MOND cannot handle the dynamics of clusters 
of galaxies without invoking additional unsee n ma tter. 
An interesting escape is suggested by Sanders jlOOl ]. In 
BSTV cosmology, just as in the PCG one [6^, the a field 
can undergo oscillations which generate bosonic particles 
early on. If the BSTV parameters are right, these bosons 
can be trapped by clusters (but not by galaxies), and 
can thus comprise the additional unseen matter. The 
charm of this resolution is that this new cold dark matter 
emerges from the modified gravity theory itself, and is 
not a separate invention. 

Relativistic MOND as here described has developed 
from the ground up, rather than coming down from the 
sky: phenomenology, rather than pure theoretical ideas, 
has been the main driver. Actually a large industry flour- 
ishes on the sidelines with imaginative ideas from first 
principles regarding the essence of MOND. I have not 
touched here on these motley approaches because they 
have given so little that is observationally viable. Neither 
have I dwelt here on modified gravity theories that are 
not MOND motivated or oriented. But the time may be 
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ripe for turning to a more deductive approach to MOND. 
Now that we have some idea of what constitutes a viable 
relativistic MOND theory, it should be easier to single 
out theoretical frameworks which might yield a promis- 
ing candidate for the fundamental MOND theory either 
as a limiting case, as an effective theory, after dimensional 
reduction, etc. 

To give an example, let us recall that Brans-Dicke mod- 
ified gravity plus Maxwellian electromagnetism in the 
real world can both be recovered by dimensional reduc- 
tion of 5-D Einstein gravity theory. TeVe S has the same 
number of degrees of freedom as that pair of theories; 
specifically, its vector field has three degrees of freedom 
on account of the normalization condition, and the elec- 
tromagnetic vector potential also has three on account of 
gauge freedom. Might some variant of TeVe S arise from 
dimensional reduction of a pure gravity theory in 5-D? If 



so, this would both ameliorate the common feeling that 
TeVe S is unduly complicated, and point the way to a 
lode for theories which might do better justice to the 
observations in the spirit of the MOND paradigm. 
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